Data Mining and Knowledge Discovery in Healthcare Organizations: A Decision-Tree Approach
نویسنده
چکیده
Health care organizations are struggling to find new ways to cut healthcare utilization and costs while improving quality and outcomes. Predictive models that have been developed to predict global utilization for a healthcare organization cannot be used to predict the behavior of individuals. On the other hand, massive amounts of healthcare data are available in databases that can be used for exploring patterns and therefore knowledge discovery. Diversity and complexity of the healthcare data requires attention to the use of statistical methods. By nature, healthcare data are multivariate, making the analysis difficult as well as interesting. In this chapter, our intention is to classify individuals that are future high-utilizers of healthcare. In particular, we answer the question of whether a mathematical model can be generated utilizing a large claims This chapter appears in the book, Creating Knowledge-Based Healthcare Organizations, edited by Nilmini Wickramasinghe, Jatinder N.D. Gupta and Sushil Sharma. Copyright © 2005, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. 701 E. Chocolate Avenue, Suite 200, Hershey PA 17033-1240, USA Tel: 717/533-8845; Fax 717/533-8661; URL-http://www.idea-group.com IDEA GROUP PUBLISHING Data Mining and Knowledge Discovery in Healthcare Organizations 79 Copyright © 2005, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. database that will predict which individuals who are not using a service in a yet untested database will be high utilizers of that health service in the future. For this purpose, an integrated dataset from enrollment, medical claims, and pharmacy databases containing more than 150 million medical and pharmacy claim line items and for over four million patients is analyzed for knowledge discovery. A modern datamining tool, namely decision trees, which may have a broad range of applications in healthcare organizations, was used in our analyses and a discussion of this valuable tool is provided. The results and managerial aspects are discussed. Several approaches are proposed for the use of this technique depending on the health plan. INTRODUCTION Many predictive models have been developed in healthcare in the past (Ash, 1999; Dunn, 1998; Dunn et al., 1995; Epstein & Cumella, 1988; Newhouse, 1986, 1995, 1998; van Vliet & Lamers, 1998; Weiner et al., 1995). However, for the most part, these models focused on how manipulation of plan design (deductibles, pays, etc.) will influence utilization behavior and to adjust for case-mix and risk for the purpose of predicting global costs and setting capitated reimbursement rates. Until recently, there has been little interest in applying prediction tools to individuals for the purpose of reducing costs and improving individual care. This lack of interest was mostly due to the absence of tools that can accurately predict future individual patient utilization, especially for patients who have had no current utilization. In general terms, current utilization of a particular kind of health service is the best predictor of future utilization of a particular kind of health service. Methods to predict future utilization of a particular service when there is no current utilization of the same service tend to produce results that are not meaningful for program managers. Today, with the rapid increase in the generation and collection of data, researchers are able to explore patterns hidden in large databases. Massive amounts of healthcare data are also available in databases that can be used for knowledge discovery. Diversity and complexity of the healthcare data requires attention to the use of statistical methods. By nature, healthcare data are multivariate, making the analysis difficult as well as interesting. The main objective of this research is to answer the question of whether a mathematical model can be generated utilizing a large claims database that will predict which individuals who are not using a service in a yet untested database will be high utilizers of that health service in the future. For this purpose we used a massive dataset containing more than 150 million medical and pharmacy claim line items and for over four million patients. This research differs from previous related studies in a number of ways: (a) The focus is on identifying individuals for targeted interventions who currently have no use of the service under study; (b) An integrated dataset from commonly available data found in enrollment, medical claims, and pharmacy databases is used; (c) A model that is built on more advanced “episodes of care” cost groupings rather than merely raw claims data; and (d) A modern data mining technique, namely a decision tree, is used for knowledge discovery. 11 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/chapter/data-mining-knowledgediscovery-healthcare/7228?camid=4v1 This title is available in InfoSci-Books, InfoSci-Medical, Business-Technology-Solution, Technology in Healthcare, Communications, Social Science, and Healthcare. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=1
منابع مشابه
Modelling Customer Attraction Prediction in Customer Relation Management using Decision Tree: A Data Mining Approach
In Today’s quality- based competitive world, known as knowledge age, customer attraction is of ultimate importance. In respect to the slogan “customer is always right”, customer relation management is the core of an organizational strategy playing an important role in four aspects of customer identification, customer attraction, customer retaining, and customer satisfaction. Commercial organiza...
متن کاملUsing Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach
Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...
متن کاملApplication of Rough Set Theory in Data Mining for Decision Support Systems (DSSs)
Decision support systems (DSSs) are prevalent information systems for decision making in many competitive business environments. In a DSS, decision making process is intimately related to some factors which determine the quality of information systems and their related products. Traditional approaches to data analysis usually cannot be implemented in sophisticated Companies, where managers ne...
متن کاملA data mining approach to employee turnover prediction (case study: Arak automotive parts manufacturing)
Training and adaption of employees are time and money consuming. Employees’ turnover can be predicted by their organizational and personal historical data in order to reduce probable loss of organizations. Prediction methods are highly related to human resource management to obtain patterns by historical data. This article implements knowledge discovery steps on real data of a manufacturing pla...
متن کاملData Mining: A Novel Outlook to Explore Knowledge in Health and Medical Sciences
Today medical and Healthcare industry generate loads of diverse data about patients, disease diagnosis, prognosis, management, hospitals’ resources, electronic patient health records, medical devices and etc. Using the most efficient processing and analyzing method for knowledge extraction is a key point to cost-saving in clinical decision making. Data mining, sometimes called data or knowledge...
متن کاملData Mining and Knowledge Discovery: Applications, Techniques, Challenges and Process Models in Healthcare
Many healthcare leaders find themselves overwhelmed with data, but lack the information they need to make right decisions. Knowledge Discovery in Databases (KDD) can help organizations turn their data into information. Organizations that take advantage of KDD techniques will find that they can lower the healthcare costs while improving healthcare quality by using fast and better clinical decisi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015